Making Views Self-Maintainable for Data Warehousing
نویسندگان
چکیده
A data warehouse stores materialized views over data from one or more sources in order to provide fast access to the integrated data, regardless of the availability of the data sources. Warehouse views need to be maintained in response to changes to the base data in the sources. Except for very simple views, maintaining a warehouse view requires access to data that is not available in the view itself. Hence, to maintain the view, one either has to query the data sources or store auxiliary data in the warehouse. We show that by using key and referential integrity constraints, we often can maintain a select-project-join view when there are insertions, deletions, and updates to the base relations without going to the data sources or replicating the base relations in their entirety in the warehouse. We derive a set of auxiliary views such that the warehouse view and the auxiliary views together are self-maintainable|they can be maintained without going to the data sources or replicating all base data. In addition, our technique can be applied to simplify traditional materialized view maintenance by exploiting key and referential integrity constraints.
منابع مشابه
Self-Maintainability of Relational Algebra Operators
The topic of data warehousing has attracted considerable attention, both from industrial and research perspectives, because of its importance in providing support for sophisticated decision analysis. In a data warehouse, the contents can be viewed as a set of views materialized from sources which are typically remote and heterogeneous. In order to keep the contents of a data warehouse consisten...
متن کاملMaking Aggregate Views Self-maintainable
Data in the warehouse can be seen as materialized views generated from the underlying multiple data sources. Materialized views are used to speed up query processing on large amounts of data. These views need to be maintained in response to updates in the source data. This is often done using incremental techniques that access data from underlying sources. In the data warehousing scenario, acce...
متن کاملSpeeding up View Maintenance Using Cheap Filters at the Warehouse
Optimizing incremental view maintenance is an important problem in data warehousing. One popular technique, view self-maintenance, proposed computing the required view updates without using the underlying base relations. Traditionally, self-maintainability of a view is determined solely based on the view definition, independent of the state of the base relations. However, this “compile-time” ap...
متن کاملMultiple-View Self-Maintenance in Data Warehousing Environments
A data warehouse materializes views derived from data that may not reside at the warehouse. Maintaining these views efficiently in response to base updates is difficult, since it may involve querying external sources where the base data reside. This paper considers the problem of view self-maintenance, where the views are maintained without using all the base data. Without full use of the base ...
متن کاملMaking Multiple Views Self-Maintainable in a Data Warehouse
A data warehouse collects and maintains a large amount of data from several distributed and heterogeneous data sources. Often the data is stored in the form of materialized views in order to provide fast access to the integrated data, regardless of the availability of the data sources. In this paper we focus on the following problem: for a given set of materialized select-project-join (SPJ) vie...
متن کامل